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We consider the task of estimating the randomly fluctuating phase of a continuous-wave beam of 
light. Using the theory of quantum parameter estimation, we show that this can be done more accu- 
rately when feedback is used (adaptive phase estimation) than by any scheme not involving feedback 
(non-adaptive phase estimation) in which the beam is measured as it arrives at the detector. Such 
schemes not involving feedback include all those based on heterodyne detection or instantaneous 
canonical phase measurements. We also demonstrate that the superior accuracy adaptive phase 
estimation is present in a regime conducive to observing it experimentally. 
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I. INTRODUCTION 

Phase is a physical property found in both classical and 
quantum electromagnetic (EM) fields. For classical EM 
fields comprising a single mode, it can be determined ex- 
actly via measuring two orthogonal quadratures or com- 
ponents of such fields. This, however, is not the case 
for single-mode EM fields in quantum mechanics. Esti- 
mates of the phases of such fields are necessarily imper- 
fect due to intrinsic quantum noise in measurements of 
non-commuting observables such as quadratures. Given 
this limitation, quantum phase estimation, the process of 
estimating the phase of a quantum-mechanical EM field 
as accurately as possible, is non-trivial. 

In addition to being non-trivial, phase estimation in 
quantum mechanics is interesting for a number of rea- 
sons. First, at some time in the future it may be prac- 
tical to encode and send information in the phase of a 
single electromagnetic field mode at or near the ultimate 
quantum limit — the upper limit permitted by quan- 
tum mechanics 0, 0, H ]- I n such a scenario, the more 
accurately a receiver could estimate phase the more in- 
formation could be sent. Second, it also may be useful 
in interferometric gravity-wave detection. Third, phase 
estimation is interesting as it is an instance of quantum 
parameter estimation jjj, |(| , an increasingly experimen- 
tally accessible field concerned with estimating parame- 
ters of quantum states as well as possible in the face of 
unavoidable quantum noise. 

Phase can be estimated via two broad approaches, non- 
adaptive phase estimation and adaptive phase estimation 
0. In non-adaptive phase estimation, which is the con- 
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ventional approach, we measure an EM field via a single 
fixed measurement that remains constant over time. In 
adaptive phase estimation, however, the measurement is 
continually adjusted in an attempt to maximise its accu- 
racy at each moment in time. This is done by changing 
or adapting it based on earlier measurement results. For 
both EM-field pulses and also continuous EM beams, it 
has been shown that adaptive phase estimation is more 
accurate than (at least) many instances of the conven- 
tional non-adaptive approach 0, 0> H 113 • 

In this paper we consider the problem of estimating the 
randomly fluctuating phase of a continuous-wave (cw) 
EM field (EM beam) as introduced in Ref. 0. We 
show that this can be done more accurately using adap- 
tive phase estimation than via any non-adaptive phase 
estimation scheme in which the field is measured in real 
time (that is, as it arrives at the detector). We also show 
that this improved accuracy exists for fields with small 
to moderate photon fluxes. These are our two main re- 
sults. The latter is significant, first, as a theoretical differ- 
ence between the accuracies of adaptive and non-adaptive 
phase estimation is most readily seen experimentally in 
fields with small to moderate photon fluxes. Second, in 
a communication scenario in which a receiver is trying 
extract information encoded in the phase of an EM field 
by a distant sender, it is likely that the receiver will be 
making measurements on fields with small to moderate 
photon fluxes due to attentuation Q. In the course of 
arriving at the two results, we present a theoretical tech- 
nique for estimating phase that may be applicable to a 
range of problems. Our results build upon earlier work 
[E IE El an d further demonstrate the superiority of 
adaptive scheme over conventional non-adaptive ones for 
the important task of phase estimation. 

This paper is structured as follows: In Section [HI we 
review the mathematical tools used throughout. They 
are Bayes' rule, the Kushner-Stratonovitch equation and 
the Zakai equation. Next, Section 1TTT1 presents the phase 



estimation schemes considered, some of which are adap- 
tive and some of which are non-adaptive. In Section ll VI 
we compare the accuracies of the schemes in the steady- 
state regime, showing that each of the adaptive schemes 
is more accurate than all of the non-adaptive schemes. 
Finally, in Section we discuss our results. 

Before proceeding further, we first review existing work 
on adaptive phase estimation. As previously stated, the 
conventional method for estimated the phase of an EM 
field is via non-adaptive phase estimation. For a single- 
mode EM-field pulse in the coherent state \/3), where f3 e 
C, the most widely known method @,||J of estimating the 
phase 4>(=a,vg(0j) uses a non-adaptive detection tech- 
nique called heterodyne detection [3. H3. Il3l IT l lT5 l ll6 | . 
This involves mixing the pulse, which we call the sig- 
nal pulse, with an intense local oscillator of phase $ = 
$0 + At at a 50:50 beamsplitter. Here A is a detuning, 
t denotes time and <&o is the phase at t = 0. The differ- 
ence between the photocurrents in the beamsplitter's two 
output ports is proportional to the quadrature phase am- 
plitude A$ — ae~ l * + a^e 1 *, where a and are creation 
and annihilation operators for the signal pulse. Assum- 
ing that A ^> r, where V is the signal pulse's spectral 
width, all quadratures are rapidly measured and thus, 
for all practical purposes, heterodyne detection instanta- 
neously measures the complex photocurrent I c contain- 
ing equal information about the observables X$ = o and 
A$ =7r /2- Once the signal pulse has been measured, <fi can 
then estimated from an appropriate functional of all the 
recorded currents. For large values of \(3\ this approach 
leads to an estimate with a variance of 1/ (2|/3| 2 ) Half 
of this is non-fundamental and results from excess noise 
introduced by heterodyne detection due to the fact that 
it measures two noncommuting quadratures. This ex- 
cess contribution to the variance can also be thought of 
as arising from the fact that heterodyne detection mea- 
sures all quadratures equally. Because of this, it some- 
times measures some the so-called amplitude quadrature 
(X<$ =l fi) which contains no information about <fi. 

A second type of EM field for which phase estimation 
has been considered is a continuous EM beam. In partic- 
ular, Ref. |l0l | considered such estimation for a continu- 
ous beam in a coherent state with phase <p that randomly 
fluctuated in time as a Wiener process [ll]]. This paper 
found that one particular non- adaptive phase estimation 
scheme estimated <f> with a variance of l/\/2N in the 
steady-state regime for N 3> 1. Here, N is the beam's 
photon flux in an amount of time equal to its coherence 
time (which is set by the timescale of the fluctuations in 

Though non-adaptive phase estimation using hetero- 
dyne detection yields a reasonable estimate of <j> for both 
a single EM-field pulse and a continuous EM beam, this 
quantity can be more accurately estimated via adaptive 
techiques 0, [E 113 For a single pulse of light, again 
in the coherent state |/3), this can be done by measur- 
ing the field using adaptive homodyne detection. Non- 
adaptive homodyne detection is identical to heterodyne 
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FIG. 1: Schematic diagram of the measurement set-up for 
adaptive homodyne-based phase-estimation schemes. The 
symbol BS denotes the 50:50 beamsplitter, D_ and D+ are 
photon counters for which the difference in the number of 
photons they detect is found and then fed back to the local 
oscillator's phase. A signal processor is denoted by SP. 



detection except that the local oscillator has the same 
frequency as the pulse's mean frequency so that <I> is 
constant ^t|- ^ i s m ade adaptive by varying <I> so as 
to try to measure the so-called phase quadrature. This 
is the quadrature for which <I> — <fi + tt/2, and, more- 
over, the one that minimises the measurement's excess 
uncertainty, below that of heterodyne detection. To try 
to measure the phase quadrature we use the results of 
previous measurements to obtain (j>fb(t), an estimate for 
4>(t). This is then fed back to the local oscillator and <f> is 
set to = <pfb(t) + tt/2 in an attempt to 'home in' on 
the phase quadrature. Fig. ^ shows a schematic diagram 
of the apparatus implementing this scheme. When |/3| is 
large, it leads to a variance in our estimate of 1/ (4|/3| 2 ) 
7] , which is only half as large as that of the non-adaptive 
scheme discussed above. Furthermore, this improved ac- 
curacy has been seen experimentally Q- 



For the continuous EM beam with a randomly fluctu- 
ating phase considered earlier, it is known that a particu- 
lar adaptive scheme is more accurate than one particular 
non-adaptive one [lOj. But is it also more accurate than 
the best possible non-adaptive scheme? One of main re- 
sults of this paper is to show, in Section ITvl that in the 
steady-state regime adaptive phase estimation is more ac- 
curate than any non-adaptive estimation scheme in which 
the EM field is measured in real time, even one involv- 
ing a canonical phase measurement |2q |. In addition, we 
show that the improved accuracy of adaptive phase esti- 
mation persists for N -C 1. 
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II. BACKGROUND THEORY 
A. What is phase? 

Within quantum mechanics, the term 'phase' has mul- 
tiple meanings 181 l25|. In this paper, however, it refers 
to a single concept which we now state. The electric field 
of a classical single-mode EM-field pulse incident on an 
ideal photodetector is, in the vicinity of this detector, 

*> = ^('*-<"— » + c-c). (2.1) 

Here, t denotes time, u is the field's angular frequency, eg 
denotes the permittivity of free space, A is the transverse 
area over which the field is spread, e represents a unit 
vector denoting the field's direction, \a\ is a complex am- 
plitude with dimensions of time -1 / 2 , c denotes the speed 
of light and c.c. represents a complex conjugate. Given 
this, we define <f> c \. to be this field's phase. Similarly, 
the phase of a quantum-mechanical single-mode EM-field 
pulse is defined to be the quantum-mechanical analogue 
of C L, which we denote by <fi. For instance, the phase of 
the coherent state ||/3|e 4 ^) is defined to be cj) which is a 
parameter and not an observable. In particular, it is not 
the observable associated with the Pegg-Barnett phase 
operator p| which is also called phase but which docs 
not have a well-defined value for the state 1 e z<?i ) . 

B. Continuous EM beam 
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FIG. 2: Schematic diagram of the physical scenario consid- 
ered. First, an idealised cw laser outputs a continuous beam 
of light which is then incident on an electro-optical modulator 
(EOM). The EOM imprints phases on segments of the beam 
which are then incident on the detection set-up on the right. 

Here dW is a real Wiener increment, r\ is the detector's 
efficiency (which is its probability of detecting an inci- 
dent photon) and <& is the local oscillator phase. In con- 
trast, heterodyne detection simultaneously measures the 
quadratures A$ = o and X$— w / 2 ■ An alternate way of do- 
ing this is to first split the signal beam at a 50:50 beam- 
splitter and then to measure X$ = $ at one output and 
X<f>=T,/2 at the other. Assuming perfect detectors, each 
photodetector measures, on average, half of the beam's 
photons and thus the quantum efficiency of each mea- 
surement is r\ = 1/2. Representing both outcomes in 
terms of a single complex quantity, we obtain 

I c dt = \a\e i4, dt + dW c , (2.5) 

where dWc is a complex Wiener increment defined by 
the correlations (dW c dW*) = dt and (dW c dW c ) = 0. 



The scenario which we consider throughout this paper 
centres around a continuous EM beam 10] known as the 
signal beam. This beam is the output of an idealised laser, 
and so can be described by a coherent state with complex 
amplitude a. The mean photon flux \a\ 2 is constant. 
However, the beam's phase (f>(t) fluctuates randomly such 
that, again in the vicinity of the detector, 



(2.2) 



Here n is a noise strength and £ is real Gaussian white 
noise defined by 



<£(*)£(*')> = <5(*-0- 



(2.3) 



In practice, this fluctuation could be achieved via an 
electro-optical modulator (EOM) that 'imprints' a 
fluctuating phase on each segment of the beam. These 
phase fluctuations give the beam a linewidth of k, so that 
N = \a\ 2 1 k is the number of photons in the coherence 
time. 

In the continuous EM beam scenario, we measure the 
signal beam via either homodyne or heterodyne detec- 
tion. For homodyne detection, the photocurrent I T mea- 
sured in the interval dt is given by 

I t dt = 2ryRe (la^-^) dt + ^dW. (2.4) 



C. Non-adaptive and adaptive phase estimation 

In a number of the phase-estimation schemes we con- 
sider, 4> is estimated using the theory of quantum param- 
eter estimation @. This process involves two steps. 
First, Bayes' rule is used to obtain a differential equation 
with respect to time for P(<j>), the probability distribution 
encoding our knowledge of <j), which we then solve. Bayes' 
rule updates our knowledge of some unknown parameter 
given the measurement result M. For the situations we 
consider, it is 



P(</>)P(M\<t>) 
P{M) 



(2.6) 



where P(x\y) denotes the probability of x given y. The 
second step in the process of estimating <f> via quantum 
parameter estimation is to use P(4>) to calculate our es- 
timate of 4>, which we denote by 4>(t). 

To explain in more detail the first step of generating 
and solving a differential equation for P(<fi), we begin by 
observing that in Eq. i|2.6[l the term P{M) is a normali- 
sation factor that ensures the normalisation of P(<f>\M). 
This can be seen by realising that we can write P(M) as 



P(M) = 



-2tt 



P{<t>)P{M\4>)dct>, 



(2.7) 
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where <po is an arbitrary lower limit. It follows from this 
that upon replacing P(M) in Eq. I|2.6I) by another func- 
tion of M that is independent of (f> we obtain a quasi- 
Bayes' rule that updates an unnormalised 'probability' 
distribution for <fr that we label P(<j>) |5|- We choose 
to replace P(M) by P(M)\ a \= Q , where P(M)\ a \= is the 
probability of measuring the result M given \a\ = 0, and 
so Eq. H2.6fl becomes 



P(</>\M) = 



P(4>)P(M\0) 
P(M) H=0 • 



(2.8) 



The function P(M)i a i =0 was chosen as it corresponds to 
considering the measurement result M in the denomina- 
tor to be Gaussian white noise which, in turn, simplifies 
Eq. (|2.ti|) . Furthermore, it yields a liner evolution equa- 
tion for P{4>). This is in contrast to the non-liner one for 
P(4>) that would have been obtained had P(M) not been 
replaced. 

The next step in obtaining and solving a differential 
equation for P(<fi) is to transform Eq. (|2.8|l into the form 



dP(0) = {f{4>)g{M) + C .c.)P{<t>)dt, 



(2.9) 



where f(<fi) and g(M) are functions whose nature depends 
upon P(M\4>) and P(M)\ a i—o, by neglecting terms of or- 
der dt 2 or higher. This equation is known as a Zakai 
equation |2l| . To obtain the desired differential equation 
for P(4>) with respect to time from Eq. I|2.9|l we normalise 
P(4>) using a known procedure detailed in Appendix A. 
This leads to the following differential equation for P(4>): 



dP(<p) 



[(« 



,*0 



(e*)p W )P(#C(t)+ac.Jdi, 



(2.10) 



where £ is either real or complex Gaussian white noise de- 
pending on the nature of M. This is known as a Kushner- 
Stratonovitch (KS) equation |22^ . 

Thus far, we have only considered the evolution of P(cj>) 
due to our measurement of the signal beam. However, 
there is also its evolution resulting from the diffusion de- 
scribed by Eq. I|2.2|l . In the absence of measurement, this 
diffusive evolution leads to P(4>) being a Gaussian distri- 
bution centred on <j>(t — 0) with variance nt. A straight- 
forward calculation shows that the evolution equation for 
this distribution in this case is the Fokker-Planck equa- 
tion 



(2.11) 



Adding the effects of phase diffusion to Eq. (|2.2l)|l leads 
to the final KS equation 



dP 



nd 2 P 



dt + \a\ [(e*+ - (e^)p W P((t) + c.c] dt. 

(2.12) 



Solving this equation we obtain P ((/>). 



As stated at the start of this subsection, the second 
step in estimating <p(t) via quantum parameter estima- 
tion is to calculate the optimal estimate for </>(i) from 
P{<f)- This is defined to be the estimate with the follow- 
ing two properties: 

1. It has the smallest possible average error as mea- 
sured by the Holevo variance [2fT |. 

2. It is such that (exp[i(0 — 4>)])i£ € K+. Here (. . .)/^ 
is an average over / and £, where / is either I c or 
J r depending on the measurement scheme. 

The Holevo variance is a measure of statistical spread 
suitable for any cyclical variable x and is given by 



V H (x) = \(e ix )\- 



1. 



(2.13) 



For such variables, it is superior to the standard vari- 
ance a 2 as the latter can be ill-defined. To illustrate this 
problem, observe that <fi has the range [cf>o, <^o+27r), where 
4>o is usually chosen to be either — n or 0. As a result, 
depending on our choice of <f>o, cr 2 (4>) can take different 
values for a single distribution. The reason for the sec- 
ond property is to rule out estimates with small Holevo 
variances but which are systematically biased and hence 
do not estimate 4> accurately. 

The optimal estimate we wish to calculate is given by 



</>(£) = arg ((exp(i<f>))p W ) 



(2.14) 



where (. . .) p/^-s denotes an average over P((f>). Whilst the 
estimate {4>) p^ is a more obvious choice for the optimal 
estimate of <fi(t), it sometimes estimates <p(t) poorly due 
to the fact that <j)(t) is a cyclical variable. This occurs, for 
instance, when P{4>) is centred near (j) . It is important 
to realise that the estimate in Eq. (|2.14|) is not optimal 
in an absolute sense. Rather, it is the best estimate of <f> 
given that we have chosen to minimise the 'cost function' 

It is interesting to note that the approach to estimat- 
ing <fi(t) outlined above differs from that in other work 
on phase estimation |E S H El- These other papers 
generated estimates based on intuitive, partially justified 
mathematical functions and, as a consequence, their esti- 
mates were sometimes sub-optimal. In contrast, a num- 
ber of this paper's phase-estimation schemes use quan- 
tum parameter estimation which leads to optimal esti- 
mates for 4>(t) (at least according to the cost or error 
function V H ((j))). 

To illustrate our method of obtaining cf>(t) via quantum 
parameter estimation, we now demonstrate its applica- 
tion in the case of measuring the signal beam via hetero- 
dyne detection. (Its use in the other cases we consider is 
very similar.) For this type of detection, Bayes' rule is 



(2.15) 
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Replacing the normalisation constant P(I C ) by P(I c )\a\=o 
leads to the quasi-Bayes' rule 



PI 



(2.16) 



^(ic)|a|=0 

Eq. I|2.5[l tells us that the real and imaginary parts of I c 
are Gaussian random variables with variances of 1/ (2dt) 
and, respectively, means of |a|cos0 and |a|sin0. From 
this it follows that 

dt 
n 



P(Ic 



= — exp [-<£*{ (Re(I c ) - \a\ cos(f>) 2 

T 

-(Im(/ c )-|a|sin0) 2 }/2] (2.17) 



whilst 



P(/e)|a|=o = 7 exp(-y7*J c ). (2.18) 

Substituting the expressions on the right-hand side of 
Eqs (|2.17(l and l|2.18|l into Eq. (|2.16(1 and neglecting terms 
of order dt 2 or higher leads to the Zakai equation 

(2.19) 



dP((f>) = |a|(e i0 / c + c.c.)P((j))dt. 



Normalising P via the known procedure detailed in 
Appendix A, from Eq. (|2.19H we obtain the Kushncr- 
Stratonovitch (KS) equation |23 

dP{4>) = \a\ [(e^ - (e^) PW )P(^)C(t) + c.c] dt, 

(2.20) 

where C is complex Gaussian white noise (£ = I c — 



M(e )pfrh\ and is so-called observation or measurement 
noise 23]). Incorporating the effects of phase diffusion, 
we arrive at 



dP(<t>) 



k d 2 P{<f>) 



dt 



[(■ 



,i<t> 



'>)p W )P(^C(*) + c.c.]dt. 



(2.21) 



Note that this equation has been previously derived, al- 
beit for a different (but related) physical system via a 
different method |2J|. It is also interesting to realise 
that we could have obtained Eq. i|2.21[l via beginning 
with Eq. (|2.ti|) . substituting into it expressions for P(I c \cf>) 
and P(I C ), and performing some algebra whilst neglect- 
ing terms of order dt 2 or higher. Though this method is 
conceptually simpler than the one we used, it involves a 
more challenging calculation. To complete the process of 
determining cf>, once we have obtained Eq. (|2.21|) we solve 
it and then use P((f>) to calculate cf>(t) via Eq. I]2.14[l . 



III. PHASE ESTIMATION SCHEMES 

In this paper we compare the accuracies of a number of 
non-adaptive and adaptive phase estimation schemes for 
an EM beam. Prior to doing so, however, we outline the 
schemes considered, detailing non-adaptive and adaptive 
schemes in turn. These are summarised in Table [I] 



A. Non-adaptive schemes 

1. Berry-Wiseman (BW) heterodyne-based 

In the Berry-Wiseman (BW) heterodyne-based phase- 
estimation scheme |l0j the signal beam is measured via 
heterodyne detection. The phase estimate at time t, (j>(t), 
is then calculated from the measurement record up to t. 
Specificially, it is 



(f>{t) = arg (A t ) , 



where At can be written as 



A t = 



due x ^-^I c (u), 



(3.1) 



(3.2) 



where x is a scaling parameter. More specifically, x 
scales the weight exp(— x(u — t)) given to each current 
I u . Whilst this estimate may not seem intuitive, it was 
chosen as an analogous estimate for the single-shot sce- 
nario was known to be accurate 0. Moreover, Ref. 
showed that, for large TV, argj4 t was an accurate estimate 
for a continuous EM beam when x was set to x — 2\a\y/K. 



2. Optimal heterodyne-based 

In this scheme, the signal beam is measured via het- 
erodyne detection and then, following the calculation in 
Section m quantum parameter estimation is used to ob- 
tain the KS equation Eq. I|2.20|l . This is then solved 
and its solution used to obtain <j)(t) in accordance with 

Eq. 1233 • 



3. Canonical 

The canonical phase estimation scheme involves mak- 
ing a canonical phase measurement |25| on the signal 
beam at each instant in time and then taking 4>(t) to 
be its outcome. Naively, it might be thought that this 
scheme would be more accurate than any other as a 
canonical measurement, or so it is thought, is the most 
accurate measurement of phase one can make. Results in 
Section Hvl show, however, that this is not the case (for 
reasons explained in Section El - 



B. Adaptive schemes 

1. Simple adaptive 

In the simple adaptive phase-estimation scheme |Toj we 
measure the signal beam via adaptive homodyne detec- 
tion and then estimate <p(t) to be 



0(t) = arg (A t ), 



(3.3) 
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where here 

A t = f d«e x ( u -*)e**/ r (u). (3.4) 

J u— — oo 

We also adapt the homodyne measurement, setting the 
local oscillator's phase to <f>(t) = <p(t) + tt/2. From this it 
follows ^3 that it is updated such that its rate of change 
with time is 

— = yfiil t {t). (3.5) 

This equation follows from letting x — ^lal^/n in 
Eq. 1)3.4)1 which is known to be optimal for large N [TfT |. 
One of the reasons the simple adaptive scheme was con- 
sidered in Ref. [ljj was that the fact that for large N it 
was known to be optimal. In Section ll VI we show that it 
also performs well for small to moderates values of N. 

2. Berry-Wiseman (BW) adaptive 

The Berry-Wiseman (BW) adaptive phase-estimation 
scheme involves measuring the signal beam via adaptive 
homodyne detection. The phase estimate at time t, 4>(t), 
is then a function of two functionals of all measurement 
results up to time t. Specifically, it is 

$(t) = arg (A t + X B t A$), (3.6) 

where A t is as defined in Eq. 1)3. 2)1 and B t is 

B t = f due x{u - t] e 2l,s,{u) . (3.7) 

J u— — oo 

As for the BW heterodyne-based scheme, this estimate 
was chosen as an analogous estimate for the single-shot 
case was known to be accurate Q • Furthermore, Ref. pH | 
showed that it was accurate, for large N, for \ = 2\a\y/~K 

3. Semi-optimal adaptive 

In the semi-optimal adaptive scheme for phase estima- 
tion, we assume it is optimal to always measure the signal 
beam's phase quadrature and thus, as in the other adap- 
tive schemes, set $(£) = <fi(t) + ir/2. The reason we use 
the label 'semi-optimal adaptive' is that, whilst we use 
quantum parameter estimation in determining (f>, we are 
not certain that it is always best to attempt to measure 
the phase quadrature. Perhaps, one could obtain a more 
accurate estimate by occasionally trying to measure the 
amplitude quadrature, for example. 

IV. RESULTS 

To compare the accuracies of the estimates introduced 
in Section IIHI we now calculate their average errors as 



measured by the Holevo variance V of the difference 
between the actual phase <fi an d our estimate 4>. Typi- 
cally, this quantity fluctuates for some time before set- 
tling down to a fixed steady-state value. Intuitively, this 
occurs as a balance arises (on average) between the infor- 
mation we gain about </> from a new photocurrent mea- 
surement and that we lose due to </>'s phase diffusion over 
the measurement's duration. We choose this steady-state 
value of V H ((j) — </>), denoted by Vgg, as our measure of 
the efficacy of our phase-estimation schemes and hence 
numerically determine it for all of them for a range of 
N values. We also obtain analytic expressions for it for 
some schemes for both large and small values of N. 

From the definition of the Holevo variance in 
Eq. lEHI, V H {(j) - <f>) is given by 

V H {4>-4>) = |( e ^)) S)J |- 2 -l, (4.1) 

where the average (. . is a stochastic average over £ 
and /. To calculate this quantity for our three estimates 
generated via parameter estimation, we first use the fact 
that 

(c i( *" ,) )«,/={|(e < *>p(«l)i (4-2) 

to express V H (cj) — (j>) as 

V H (d>-4>) = {\{e^) PW \)f-l. (4.3) 

A demonstration of Eq. 1)4.2)1 is given in Appendix B. 
After arriving at Eq. ()4.3)l . we then use the ergodic theo- 
rem within this equation to replace the ensemble average 
(|(e I?i )p(0) \)i in the steady-state by the temporal average 

—^-SS f d£\{e*(t)) PW \, (4.4) 

where is the time at which the steady-state regime 
begins and tf is the final time we consider (tf ^> t^ s ). 
This allows us to determine Vgg through simulating just 
a single stochastic trajectory. 

Upon calculating Vg S , a number of trends are appar- 
ent. The first of these concerns the proximity of <j> to (f> 
in the simple adaptive scheme. For large N, the initial 
estimate <j)(t = 0) for this scheme is usually some dis- 
tance from the actual phase (f>(t — 0). Then, as we gain 
more and more information via measurement and post- 
processing, <f> 'homes in' on <p during an initial period of 
transience. After this it 'locks onto' <fi, staying close to <p 
as it continues to fluctuate a little. This pattern of be- 
haviour is illustrated in Fig- EJi) - It is anticipated that all 
the schemes considered behave similarly, though we did 
not explicitly verify this. For small values of N, <f> never 
locks onto (j) but instead continues to fluctuates in its 
vicinity with a magnitude that increases with decreasing 
N, as highlighted in Fig. [3] ii) . 

A second trend in our results concerns the size of the 
interval within which we are fairly certain that <f> lies at 
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FIG. 3: Graphs showing typical variations of the actual phase 
4> (solid line) and our estimate <f> (dotted line) versus time t 
scaled by k for the simple-adaptive phase-estimation scheme 
for: i) a large photon flux (N = 1000) and ii) a small one 
(TV = 0.1). In i), (j> initially 'homes in' on (j>, before locking 
onto it. In ii), the low photon flux means we gain so little 
information from our measurements that <f) never locks onto 
<j>. Both <j> and (f> are dimensionless, as is t. 



any moment in time. This is measured by the Holevo 
variance |(e l< ^)p(0)|~ 2 — 1 which can be thought of as a 
measure of our lack of confidence in cf>. For large TV, this 
quantity, at least for the schemes based on parameter es- 
timation, only fluctuates over time by a small amount 
once the initial transience ends. This behaviour is illus- 
trated in Fig. Q i) . It can be explained by realising that 
when TV is large we are in a linear regime in the sense that 
the measured photocurrent, I r or I c , is a linear function 
of the actual phase (/). For instance, for homodyne de- 
tection we have I T dt = 2r]\a\((j) —</>) + ^/rjdW. It is a 
characteristic trait of such linear systems that our level 
of confidence (and hence also our lack of confidence) in 
any estimate of a system parameter is constant in the 
steady state [2^. For small TV, however, | (e^) p(^) |~ 2 — 1 
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FIG. 4: Graphs showing typical variations of our lack of con- 
fidence in <f> as measured by |(e 1 ^ > }p(0) | -2 — 1 (dimensionless) 
versus time t (dimensionless) scaled by k for the semi-optimal 
adaptive phase-estimation scheme for: i) a large photon flux 
(TV = 1000) and ii) a small one (N = 0.1). 

fluctuates appreciably for all t (for the schemes based on 
parameter estimation), as shown in Fig. J3J ii). 



A. Non-adaptive schemes 

1. Berry-Wiseman heterodyne-based 

Previous work [13 has calculated Vt? s for the BW 
heterodyne-based scheme for a range of ./V values. These 
results are plotted in Fig. [SJ For large TV, it is known 
[Io| that the scheme has a steady-state error of Vgg ~ 
1/V2TV. 



2. Optimal heterodyne-based 

For the optimal heterodyne-based phase-estimation 
scheme, Vgg was calculated by determining the tempo- 
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ral average in expression 14 . 41 and then using Eq. (|4.3ll to 
find Vgg. This was done by, first, expressing P(<f>) in 
Eq. 12.21JI as the following discrete Fourier series: 



P{4>) = & J ex Pfe'^)' 



(4.5) 



where bj S C and b^j — b*. Next, the resulting equa- 
tion was transformed into Fourier space to produce the 
following coupled differential equations: 



h = - 



nfbj 



|a|C6j-i + \a\Cb j+1 - 47r&,-|a|Re(C&i) 



. (4 ' 6) 

These were then numerically solved by only considering 
bj's for which \j\ was less than some finite bound that 
increased with N. Next, (e 1 ^) p($)(t) was determined 
by exploiting the fact that it is a function of just one 
Fourier coefficient Finally, we averaged over nu- 

merous steady-state values of (e^} P ^(t) to obtain ex- 
pression H4.4fl and thus Vg S . The results generated are 
plotted in Fig. [3J Analytic results were also found for 
large and small N which are Vgg ~ 4/(ttN) (small N) 
and Vf s ~ 1/V2N (large N). 

Our analytic result for Vg S for small N was obtain- 
ing by first realising that when N <C 1 heterodyne mea- 
surements on the signal beam yield little information 
about (j) an d thus P{4>) is broad. This means that, in 
contrast, P(</>)'s Fourier transform is narrow and, more 
specifically, that the following relations hold (on average) 
|&o| \bi\ ^> I&2I — Because of this, we can neglect 
Fourier coefficients for which \j\ > 1 in Eq. I|4.6[) . Upon 
doing this, and also neglecting terms containing |6i| 2 (as 
I bi 1 2 <C |&i|), we are left with just the following equation 
for b\ 



~dt 



nb\ 
~2 



2vr ' 



(4.7) 



Note that bo(t) = \/(2tt) (as can be determined from the 
normalisation condition J^l^° P(<p)d<fi — 1). Solving 
Eq. (|4.7|l we find that in the steady-state b\ is a complex 
Gaussian random variable with mean zero and a variance 
of N/(8tt 2 ) in both its real and imaginary parts. 
To calculate Vgg from b\ we first note that 



lp(<j>)\ 



4>o+2tt 



#e^P(0)|. 



(4.8) 



4>=<po 



Substituting P(4>) — J^'jL-oo bj exp(zj^) into the right- 
hand side of this equation yields 

l(e i0 )p(«| = 2tt|6i|. (4.9) 
From this it follows that the equation 

^ = «K^)p W I)7 2 -i> (4-io) 

simplifies to 

y<f s = (2^(|6 1 |) c )- 2 -l. (4.11) 



Given that (|6i|) c ~ VN/(4y/n) we obtain Vf s ~ 
A/(ttN) — 1. Neglecting the second term (as this pro- 
duces a more accurate approximation) yields 



v ss 



4/(7rJV). 



(4.12) 



The large-A^ approximation for Vgg for the optimal 
heterodyne scheme was obtained by replacing the expo- 
nents in Eq. (|2.21J) by a linear approximation and then 
assuming that P{4>) was Gaussian. Differential equations 
with respect to time for the mean and variance of this 
Gaussian were then constructed and solved to obtain the 
standard variance of 4> in the steady state which, for large 
N, is approximately equal to Vgg. 

The expression (e^ - (e^) P(0) ) C in Eq. ^TT^ can be 
re-expressed as 



(4.13) 



When N ^> 1, the large photon fluxes present in the 
signal beam mean that our measurements yield a great 
deal of information about <fi an d hence that is a highly 
accurate estimate. As a result, e l ^~^ ~ 1 and thus we 
can linearise expression (|4.13|) as follows 



) P(0) ) e ^C^(<M0)p(0))e # C (4-14) 



The expression e l ^C behaves as complex Gaussian white 
noise and hence we denote it as Substituting the above 
results into Eq. 12.21JI . we obtain 



dP(<t>) 



nd 2 P 



dt-2\a\(i(4>-(<f>) PW )Re(C'))dt. 

(4.15) 



To solve this equation, we assume that P((f>) is Gaussian 
and thus that it can be expressed as 



P{4>) = 



exp[-(0- MP ) 2 /(2a p )] 



2nap 



(4.16) 



where \ip and ap are, respectively, P's mean and vari- 
ance. Generating differential equations for \ip and crp, 
we obtain, 



dap 



rf(0 2 )p(0) -d((4>) PW ) 

d((f> 2 ) p(4>) - 2((f>) PW d((f>) PW - (d((f))p {<t>) y 



and 

dfip = -2\a\a%Re(C)dt. 
Solving these yields 

1 exp(2 v / 2|a| 2 i/ViV) + 1 



a P (t) 



/ 2]Vexp(2\/2|a| 2 t/v / A 7 ) - l' 



(4.17) 



(4.18) 



(4.19) 
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In the limit of t 



oo this reduces to 
1 



(4.20) 



Interestingly, this result is the same as that obtained in 
[To| . This shows that the BW heterodyne-based scheme, 
which was designed for large N, is indeed optimal in this 
regime. 



3. Canonical 

For the canonical phase estimation scheme, 4>{t) was 
calculated via quantum parameter estimation using the 
method in Subsection III CI For this scheme, Bayes' rule 
is 



P(<f>)P(Q\4>) 

p(o) ■ 



(4.21) 



where 9 is the measured phase. As a canonical phase 
measurement is a projective measurement of the Pegg- 
Barnett phase observable pj, the probability of it yield- 
ing the result 9 is (2tt)^ 1 times the square of the norm of 
the measured state's projection onto the (unnormalised) 
phase eigenstate \9) = J2^Lo e m9 \n). Thus, for the co- 
herent states we consider, to first order in -\fd~t, 



P(0\tf>) 



1 

27 



\{aVdt\9}\' 



-!- (l + 2|a|\/cftcos(6> -</>)) (4.22) 



and thus 



P(0) W =o = (2tt)- 



(4.23) 



Substituting the expressions on the right-hand sides of 
Eqs (|4"2H) and into Eq. (Q]} leads to the follow- 

ing Zakai equation 



dP((f>) = l±(e i ^- a) + c.c.)P(<f>)dt. 
Vdt 



(4.24) 



Using the known correspondence detailed in Appendix 
A, this, in turn, leads to the KS equation 



dP{<j>) = 2x 

Re | lal 



{e *4>)_ {e i* )pw) e P{(j)) 



dt 



(4.25) 

Letting e~ l6 ' /Vdt = /, we find that (/) = (/ 2 ) = 
(at least when we average over any finite time interval) 
and (//*) = 1/dt from which it follows that / is com- 
plex Gaussian white noise. Given this, Eq. (|4.25|1 reduces 
to Eq. H2.20I) . the KS equation obtained for the optimal 
heterodyne-based phase-estimation scheme. As a result, 
the canonical scheme shares the same accuracy as this 
other scheme and so shares the same results for Vgg. 
This surprising result is explained in Section Ivl 
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FIG. 5: Log-log plot of the steady-state Holevo variance 
Vgg versus photon flux N for the BW heterodyne-based 
non-adaptive (squares) and optimal heterodyne non-adaptive 
(solid line) phase estimation schemes. Both Vgg and N are 
dimensionless. 



4- Comparison 

As can be seen from Fig. |SJ when N < 10, the 
optimal heterodyne-based phase-estimation scheme is 
slightly more accurate than the BW heterodyne-based 
one. For larger values of N, however, we see that both 
schemes seem to be equally accurate. (At approximately 
N = 10 125 , the BW heterodyne-based scheme appears to 
be more accurate, but this is due to numerical errors, pri- 
marily in the BW heterodyne-based result.) The first of 
these features illustrates that while the BW heterodyne- 
based scheme is close to optimal for N < 10, <f> can be 
estimated more accurately using parameter estimation in 
this regime. The latter fact is particularly significant as 
this regime is the one in which an experimental reali- 
sation could most readily be performed, as discussed in 
more detail in Sect.[V] The second feature highlights that 
the BW heterodyne-based scheme is optimal for TV > 10 
which is unsurprising as it was designed for large N |10| . 



B. Adaptive schemes 

1. Simple adaptive 

For the simple adaptive phase-estimation scheme, the 
Holevo variance in the steady-state was calculated by 
simulating the evolution of 4>(t) via solving Eq. I|2.2I) and 
also simulating the measurement outcomes on the beam 
using Eq. I|2.4|l to obtain a numerical expression for I T (i) 
for a range of times. This allowed us to update 



via 



dt 



(4.26) 
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and thus to determine (pit) — 4>(t), again for a range of 
times. The local-oscillator phase &(t) was then set to 
<f>(i) = 4>(t) + 7r/2. The steady-state Holevo variance Vgg 
was calculated from the difference <j>(t) — 4>(t). 



2. Berry- Wiseman adaptive 



For the BW adaptive scheme, Ref. [Tl| determined 
Fig.© 



Vgg as a function of N and these results are shown in 



3. Semi-optimal adaptive 

We derived <fi for the semi-optimal adaptive scheme via 
quantum parameter estimation in the same manner as for 
the optimal heterodyne and canonical schemes. For this 
scheme, Bayes' rule is 



(4.27) 



Replacing the normalisation constant P(I T ) by P (I r )\ a \=o 
yields the quasi-Bayes rule 



14) = 



P(lr)\ a \=0 ' 



(4.28) 



From Eq. (|2.4|l we know that J r is a Gaussian random 
variable with variance 1 / (rft) and mean 2|a|cos(0 — $) 
from which it follows that (for r\ = 1) 



di 



P(<j>\I x ) = — exp [-dt{I x - 2|a|cos(0- $)} 2 ] 



and 



P(It)\ a \*o = -cxp(-dt/ r 2 ) 



(4.29) 



(4.30) 



Substituting these two results into Eq. 14.28fl . we obtain 
the following Zakai equation 



dPty) = |a|(e iW -* 3 J r + c.c.)P(^)dt 



(4.31) 



Using the known correspondence detailed in Appendix A 
and including the effects of phase diffusion, Eq. (|4.31|) 
leads to the KS equation 

dP(0) = 1 -^dt + 

\ a \ f( e i(*-*) _ ( e <(*-*)) PW )p(^)C'(i) +c.c 



(4.32) 



where £' is reaZ Gaussian white noise given by £' = It 
2|a|(cos(0-$)) PW . 
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FIG. 6: Log-log plots of the logarithm of steady-state Holevo 
variance Vgg versus the photon flux N for the BW adaptive 
(squares) and the semi-optimal adaptive (solid line) phase- 
estimation schemes. Both Vss and -/V are dimensionless. 



To obtain Via from Eq. I|4.32|l we applied the same 
method used for the optimal heterodyne-based scheme 
centred around decomposing P(<fi) via the Fourier decom- 
position in Eq. I|4.5|l . The results obtained are plotted in 
Fig. [fjj In addition, for small and large N the following 
analytical results were found: 



V S H S ~1/(2VN) (large N) 



V" s - 1 / N (small N). 



(4.33) 



(4.34) 



These results were obtaining via calculations very similar 
to those used in Sect. II V A2l to obtain the corresponding 
estimates for optimal heterodyne detection. 



4- Comparison 

Figs and \7\ display a number of interesting features 
which we now highlight. First, Fig. [7| shows that the 
semi-optimal adaptive and simple adaptive schemes are 
equally accurate, as evidenced by the fact that they have 
identical V^-versus-AT plots. Second, Fig. illustrates 
that the semi-optimal adaptive scheme (and hence also 
the simple adaptive scheme) is more accurate than BW 
adaptive one for all values except when > 10 3 ' 5 . 
Third, Fig. [7] demonstrates that the semi-optimal adap- 
tive scheme is significantly more accurate than the op- 
timal heterodyne-based or canonical schemes. Fourth, 
Fig. also shows that adaptive phase estimation is more 
accurate than any non-adaptive phase-estimation scheme 
in which the field is measured in real time. The rea- 
son for this is the following: Assume that we measure 
the field non-adaptively in real time. By this we mean 
that we measure it via a continuous sequence of iden- 
tical infinitesimal-time measurements and thus measure 
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FIG. 7: Log-log plots of the logarithm of steady-state 
Holevo variance V^s versus the photon flux iV for the op- 
timal heterodyne-based (solid line), semi-optimal adaptive 
(squares) and simple adaptive phase-estimation schemes (dot- 
ted line with *'s). The large N and small N results lie upon 
the asymptotes derived for these regions. Both Vg S and N 
are dimensionless. 



each spatial 'segment' of the signal beam as it is inci- 
dent on the detector. In this scenario, the best measure- 
ment we can make is a canonical phase measurement (as 
we must decide what to measure whilst knowing nothing 
about the phase). However, from Subsection IIII A 3l we 
know that estimating <j> non-adaptively via such a mea- 
surement leads to an estimate only as accurate as that 
of the optimal heterodyne-based scheme. We also know 
that adaptive phase-estimation is more accurate this lat- 
ter non-adaptive scheme in the cw scenario and hence 
it is also more accurate than the canonical non-adaptive 
scheme. 



V. DISCUSSION 

The results of Section llVI display a number of interest- 
ing features which we now discuss. First, it might seem 
puzzling that the canonical phase-estimation scheme is 
only as accurate as the optimal heterodyne-based scheme 
and is not, instead, the most accurate scheme. Given that 
a canonical phase measurement is generally thought to be 
the best measurement of phase we can make, why isn't 
the canonical scheme the most accurate? The answer to 
this lies in the details of the scenario we consider. In the 
standard scenario in which we wish to estimate phase, 
we make a single phase measurement on a system for 
which we have no prior information about the phase. In 
this scenario, a canonical measurement is optimal. How- 
ever, in the scenario we consider prior to making a mea- 
surement on the field at time t ^ 0, we already know 
something about cj>, as evidenced by the fact we possess 
a non-trivial probability distribution P{<j>). This prior in- 



formation can be exploited by measurements other than 
a canonical one to yield more information about phase 
than would a canonical measurement. 

To understand the preceeding point it may be helpful 
to consider the following example: Say we wish to deter- 
mine as accurately as possible the phase of a system in 
a weak coherent state which we know to be either one of 
the two states \ip±) = |0) + 7e ±4 ^|l), where 7 G R < 1, 
with equal probability. In this instance, because we al- 
ready know something about cj>, we can tailor the mea- 
surement in accordance with this prior knowledge and 
measure the $ = ir/2 or Y quadrature to obtain slightly 
more information about <p than would a canonical mea- 
surement. Specifically, measuring the Y quadrature, we 
estimate <fi correctly with probability 1/2 + 0.7997 sin </>, 
while for a canonical measurement this probability is only 
1/2 + 0.6387 sin</>. 

Another interesting feature related to Section iTVl s re- 
sults concerns the main conclusion we drew from them, 
which was that adaptive phase estimation in the cw sce- 
nario is more accurate than any non-adaptive scheme in 
which the field is measured in real time. Though we were 
able to arrive at this result, we are uncertain if adaptive 
phase estimation is better than any non-adaptive scheme 
at all. This is because it is conceivable that there exists 
a non-adaptive scheme in which, instead of measuring 
the field in real time, we store up a portion of it over a 
period of time and then measure the accumulated field 
as a whole that is more accurate than adaptive phase 
estimation. 

The results of Section I1VI also show that simple adap- 
tive does as well as semi-optimal adaptive. Why does this 
relatively uncomplicated scheme do so well? One possi- 
bility is that the state of the beam we consider, being 
based on coherent states, is somewhat 'simple'. Perhaps, 
it does not allow us to fully utilise the power of the more 
complicated semi-optimal adaptive scheme. 

One final interesting feature of Section II Vi s results 
concerns the variation with N of the relative superi- 
ority of adaptive phase estimation over real-time non- 
adaptive phase estimation. This is measured by the 
ratio of the steady-states Holevo variances for the op- 
timal heterodyne-based and the semi-optimal adaptive 
schemes. For A^< 1, this ratio is given by Eqs l|4.12|l and 
and is 4/tt ~ 1.27 while for N > 1 it is y/2 ~ 1.41. 
For intermediate N values, it lies in between these two ex- 
tremes. Of particular importance is the fact that the gap 
is present for N ~ 1. This is because this regime is the 
most fertile for experimental implementation as within it 
the errors we wish to see are not swamped by technical 
noise. It is also noteworthy that the small- N ratio of 4/tt 
is significantly greater than the analogous ratio in Ref. 
[lfj, which was approximately 1.1, between the adaptive 
and non-adaptive estimates in this other paper. 

Having discussed the results in Section lTVI we now turn 
to two theoretical issues arising from our work. First, in 
this paper we have considered estimating the phase of an 
EM beam in a coherent state. However, other beams 
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could be investigated as was done in Ref. ]nj which 
looked at a so-called squeezed EM beam with a ran- 
domly fluctuating phase. That paper found that, for such 
a beam, adaptive phase estimation was more accurate 
than heterodyne-based non-adaptive phase-estimation 
not just by a constant factor (as this paper has), but 
by a factor scaling with N. In particular, it found that 
for such a beam the steady-state Holevo variance of the 
error scaled as TV -2 / 3 in adaptive phase-estimation but 
only as TV -1 / 2 in heterodyne-based non-adaptive phase 
estimation. 

While this result for squeezed beams is interesting, the 
calculations behind it contained a number of deficien- 
cies. First, Ref. ^3] considered a beam with broadband 
squeezing, i.e. one that was squeezed at all frequencies, 
and thus the noise present in the beam had infinite en- 
ergy. The parameter N — \a\ 2 / n was finite, however, as 
it relates only to the energy carried by the mean field. 
Such a beam is unphysical and, furthermore, constitutes 
an inappropriate theoretical model for the problem con- 
sidered, as we shall soon see. The second deficiency in 
the calculation was that it involved estimating <f>{t) using 
only information about the beam's signal. This meant 
that information in the beam's noise was ignored. If such 
information had been used then, as the noise had infinite 
energy, we could have instantly determined (j) by deter- 
mining the relative sizes of the noise in different quadra- 
tures. Thus, the calculation in Ref. 01 ignored obtaining 
phase information from a potential source (the noise) and 
revolved around a model such that if we do consider this 
potential source, we find that we can instantly determine 
<p(t) with perfect accuracy, which is unrealistic. Because 
of these deficiencies, we feel that it is desirable to do ad- 
ditional calculations on squeezed beams. We anticipate 
that our 'optimal' approach to obtaining phase estimates 
based on quantum parameter estimation may be useful 
in such calculations. 

A second theoretical issue arising from our work is the 
following: Throughout the paper, it was assumed that 
|a| was known precisely. However, even if we only know 
that | a | > a, where a S K we can still do at least as 
well as when we know that it equals a. This follows on 
from work by Stockton et al. [26j (Sect. V). Knowing 
|a| precisely, we have, for the simple adaptive (and semi- 
optimal adaptive) schemes, 



Xopt = 2 v / /t|< 



(5.1) 



If we only know that |a| > a we can set x equal to 

X = 2y/^a. (5.2) 

For N > 1, this leads to [HI 

V^ s = 2V^a/(8a 2 ) + vV( 2a ) ( 5 - 3 ) 
~ v/ic/(2o). (5.4) 

That is, we can estimate <fi at least as well as we can as- 
suming we know that |a| is exactly the minimum known 
value. 



VI. CONCLUSION 

Quantum phase estimation and, in particular, Bayes' 
rule were used to find optimally accurate phase estimates 
and to show that, for a continuous EM beam with a ran- 
domly fluctuating phase, adaptive phase-estimation is 
more accurate than any non-adaptive phase-estimation 
scheme in which the field is measured in real time. 
Though it is more accurate for all photon fluxes it is, 
in particular, more accurate for such beams possessing 
small to moderate photon fluxes. This is important as 
this is the regime in which in experiments would have 
the greatest chance of confirming any theoretical differ- 
ence between the two types of phase-estimation schemes. 



VII. APPENDIX A 

This section details the known correspondence between 
a Zakai equation of the form 

dP = {XI + c.c.) Pdt (7.1) 

and the KS equation 

dP= [(X - (X) P )(I- (I) P )+c.c.}Pdt. (7.2) 

To obtain Eq. I7.2|l from Eq. (|7.1|) . we begin with the 
identity 



P(0) + dP{4>) = 



p + dP 



(7.3) 



Taking out a factor of J, d<j)P(<j)) in the denominator 
leads to 



P{4>) + dP{cj>) 



p + dP 



1 



Id 



(7.4) 



Expanding the expression in the denominator within the 
square pararentheses as a power series using the binomial 
theorem ((1 + x) n — 1 + nx + n(n — l)x 2 /2 +...), yields 



P^+dP^) 



P + dP 



d(j)P 



(7.5) 

Normalising the distribution P using the factors of 
J, d(j)P((j)) in the denominator and also substituting in 
the expression for dP in Eq. (|7.1(l . we obtain 

P + dP = (P+(XI + c.c.)Pdt) 

\ I I / d(f>(XI + c.c.)P(<p)dt 

2\ 



dcj)(XI + c.c.)P(4>)dt 



(7.6) 
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(P + (XI + c.c.)Pdt) (1 Substituting this result into Eq. 18.4(1 and integrating 

over £ yields 



-{(X)pl + c.c.)dt + {{X) P I + c.c.)*) 2 ) 



(7-7) , , 

Expanding this expression and keeping only terms of or- Ji J v 

der dt or less, we arrive at Eq. (|7.2(l . (8.6) 



VIII. APPENDIX B 



Using elementary probability theory, we obtain 



In this appendix we demonstrate that, for the schemes 
based on quantum parameter estimation (the optimal 
heterodyne-based, the canonical and semi-optimal adap- 
tive schemes), 



{e i «-b)u = (\{e i *) PW \) 1 



.1) 



By definition 



(c*W-*)) C)J = 



d£dIP(£, I)e 



(8.2) 



Expressing e 1 ^^' as an integral over the dummy 

phase variable ip, we obtain 

e im)-iW) = f dipS((f>^)-ip)e l[v -^ I)] . (8.3) 



Substituting the right-hand side of Eq. ((8.3(1 into the 
right-hand side of Eq. 1(8.2(1 yields 



d£did<pP(OP(i\OHHO - <py lip -t {1)] . 



(8.4) 



Assuming we know the so-called process noise £, then 
we know the phase <f> exactly and thus our probability 
density function for is a Dirac delta. From this it fol- 
lows that 



p{i\mm-v)d<p=p(v,mdy. 



(8.5) 



dIP {I) / dtpP{ip\iy [ip ~^ I)] . 



•7) 



Given that 



0(/)=arg / d^P{y'\iy* 



(8.8) 

where ip' is a second dummy phase variable, Eq. 1(8.7(1 
leads to 



( e *(0-0)) Cj/ = JdIP{I) 



dtpP(tp\I)e ilp 



(l(e^)pMl)/- 



(8.9) 



Upon replacing ip by <j) in the final expression, where <p 
now acts as a dummy phase variable, Eq. 1(8.1(1 is ob- 
tained. 
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Name of Measurement Scheme 


4> 




Type of Detection 


Canonical 


arg ((exp(i0)) PW ) 


N/A 


canonical 


Optimal heterodyne-based 


arg((exp(i0)) P(w ) 


A 


heterodyne 


BW heterodyne-based 


arg (At) 


A 


heterodyne 


BW adaptive 


arg (A t + X B t A* t ) 




homodyne 


semi-optimal adaptive 


arg ((exp(i0)) P( ^) 


d<j>/dt 


homodyne 


simple adaptive 


arg (At) 




homodyne 



TABLE I: Summary of phase estimates 



